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Abstract. We give a new algorithmic solution to the well-known five-point 
relative pose problem. Our approach does not deal with the famous cubic 
constraint on an essential matrix. Instead, we use the Cayley representation 
of rotations in order to obtain a polynomial system from epipolar constraints. 
Solving that system, we directly get relative rotation and translation parame- 
ters of the cameras in terms of roots of a 10th degree polynomial. 



1. Introduction 

In the paper presented we give an algorithmic solution to the 5-point 2-view 
relative pose problem. It is formulated as follows. 

Problem 1. We are given two calibrated pinhole cameras with centers Oi, O2 
and five points Qi, . . . , lying in front of the cameras in 3-dimensional Euclidean 
space, see Figure[TJ In every camera coordinate frame the directing vectors of OjQi 
are only known. The problem is in finding the relative position and orientation of 
the second camera with respect to the first one. 




Figure 1. To formulation of the five-point relative pose problem 

The 5-point relative pose problem is a key to the 3d scene reconstruction problem, 
which is in turn used in many computer vision applications such as augmented 
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reality, self-parking systems, robot path-planning, navigation, etc. It is well known 
that 5-point algorithms yield significantly better results in accuracy and reliability 
than 6-, 7- and 8-point algorithms. Moreover, for planar and near-planar scenes only 
5-point method allows to get a robust solution without any additional modification 
of the algorithm. 

Problem[I]was first shown by Kruppa [8] in 1913 to have at most eleven solutions. 
Using the methods of projective geometry, he proposed an algorithm for solving the 
problem, although it could not lead to a numerical implementation. Demazure [5] , 
Faugeras and Maybank HI, Heyden and Sparr 6 then sharpened Kruppa's result 
and proved that the exact number of solutions (including complex) is ten. 

More efficient and practical solution has been presented by Philip [13] in 1996. 
His method requires to solve a 13th degree polynomial. In 2004 Nister improved 
Philip's algorithm and expressed a solution in terms of a real root of 10th degree 
polynomial. Afterwards, there were presented many modifications of that algorithm 
simplifying its implementation [lOJ or making it more numerically stable |9l [15] . 

In this paper we give yet another algorithmic solution to the problem using the 
well-known Cayley representation of rotation matrices pLJ. Our approach does not 
mix rotation and translation parameters of an essential matrix and nevertheless 
allows one to express a solution in terms of a root of lOth degree univariate poly- 
nomial. Experiments on synthetic data show that the method is comparable in 
accuracy with the existing five-point solvers. 

The rest of the paper is organized as follows. In Section [2] we describe in detail 
our algorithm. In Section[3]we make a comparison of our algorithm with the original 
Nister solver [12] on synthetic data. Section |4] concludes. 

1.1. Notation. We use a, b, . . . for column vectors, and A, B, . . . for matrices. For 
a matrix A, the entries are Aij, the transpose is A"^, the trace is Tr(A), and the 
determinant is det(A). For two vectors a and b, the vector product is ax b, and the 
scalar product is a'^b. For a vector a, the notation [a] x stands for a skew-symmetric 
matrix such that [a]xb a x b for any vector b. 

We use I for identical matrix and for zero matrix or vector, || ■ || for the 
Frobenius norm. 



2.1. Initial data transformation. Initial data for our algorithm are the homo- 
geneous coordinates Xji, yji, Zji of points Qi in the coordinate frame of jth camera, 
j = 1, 2, i = 1, . . . , 5 (see Figured]). 

Without loss of generality we can set Xji ~ yji = Xj2 = for j — 1,2. The 
numerically stable way of doing this is as follows. We combine the initial data into 
two 3x5 matrices 



2. Description of the algorithm 




a; J 5 

Zj5, 



(1) 



and compute the matrices 




Hj2HjiAj, 



(2) 
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where Hji and Hj2 are the Householder matrices zeroing xji, yji and Xj2 respec- 
tively. The corresponding Householder vectors are 



Xji 



Zji + sign(zji)yx2~ry|7+4i" 



^2 



i2 




We will see that transformation ([2]), being quite simple, noticeably simplifies our 
further computations. In particular, this will allow us to easily convert the resulting 
20th degree univariate polynomial ([T^ to the 10th degree polynomial ([T^ . 

2.2. Epipolar constraints and essential matrix. We first recall some defini- 
tions from multiview geometry, see [31 [51 111] for details. A pinhole camera is a 
triple (0,7r, P), where tt is an image plane, P is a central projection of points in 
3-dimensional Euclidean space onto tt, and O is a camera center (center of projec- 
tion P). The focal length is the distance between O and tt, the orthogonal projection 
of O onto TT is called the principal point. A pinhole camera is called calibrated if all 
its intrinsic parameters (such as focal length and principal point's coordinates) are 
known. 

Let there be given two calibrated pinhole cameras {Oj, ttj, Pj), j = 1, 2. Without 
loss of generality we can set Pi = [l O] , P2 = [R t] , where R G S0(3) is the 

rotation matrix and t — \ti t2 ^3] is the translation vector normalized so that 
l|t|| = l. 

The well-known epipolar constraints [S] on R and t read: 



y2i 



Z2^] E 



yii 



0, 



(3) 



where E = [t] x R is called the essential matrix. 



2.3. Ten fourth degree polynomials. Our approach is based on the following 
well-known result. 

Theorem 1 ([I]). If a matrix R G S0(3) is not a rotation through the angle ir+iTik, 
k G Z, about certain axis, then R can be represented as 

-1 



R 



I 




(4) 



where u,v,w E M. 

Let R be represented by ^ and E(u,i;, w,t) = [t]xR be an essential matrix. 
Proposition 1. // 



-tl 


- Vts + Wt2 




5 


-t2 


— wti + ut^ 




S 


-h 


- Ut2 + Vti 


s 



(5) 



where S — uti + w<2 + wtz, then E(u', v' , w' , t) = — E(u, u, w, t). 
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Proof. Consider a matrix R' = — HtR G S0(3), where the Householder matrix 
Ht = I — 2tt^ . Then, E' = [t]xR' = — E. By a straightforward computation, the 
equation R'(m', w', w') — — HtR(u,u,ui) has a unique solution ([5]). □ 

Since epipolar constraints ([3]) are linear and homogeneous in t, we can rewrite 
them as 

St = 0, 

where the ith row of 5 x 3 matrix S is 



(6) 



X2i 

y2i 

Z2i 



Now we represent rotation R in form ([4]) and take the determinants of all 3 x 3 
submatrices of matrix S. This yields ten polynomial equations: 

+ [l]uv^ + + [2]u^ + [2]uv + [2]i;2 + [3]u + [i]v + [4] = 0, (7) 

where i — 1, . . . , 10, [n] means a polynomial of degree n in the variable w, [0] is a 
constant. 

Remark 1 . Actually, the determinants of 3 x 3 submatrices of S give the following 
expressions: 

A3' 

where A = 1 + + + and Fi is a polynomial of 6th total degree. However, 
one can verify that Fi is factorized as Fi = fiA and the coefficients of fi are easily 
deduced from the coefficients of Fi. 

2.4. Tenth degree univariate polynomial. Let us rewrite system ([7]) in form 

Bm = 0, (8) 

where B is a 10 x 35 coefffcient matrix and 



m 



V w 1 



is a monomial vector. 

We expand system ([5]) with 20 more polynomials ufi, vfi for i = 1, . . . , 5, and 
wfi for i = 1, . . . , 10. Thus we get 



B' 



m 
m 



0, 



(9) 



where B' is a new 30 x 50 coefficient matrix and 



r4 3 3222 2 223 

m — [u WjU vw,u w ,u V w,u vw ,u w , 



is the five-degree monomial vector. It is clear that system ([9]) is equivalent to ([8]). 

We rearrange columns of matrix B' and perform Gauss- Jordan elimination with 
partial pivoting on it. Then the last six rows of the resulting matrix can be repre- 
sented in form 
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u 


V 


1 


91 


1 


[3] 


[4] 


[4] 


[5] 


92 




1 [3] 


[4] 


[4] 


[5] 


93 




1 [3] 


[4] 


[4] 


[5] 


94 




1 [3] 


[4] 


[4] 


[5] 


95 




1 [3] 


[4] 


[4] 


[5] 


96 




1 [3] 


[4] 


[4] 


[5] 



where empty spaces are occupied by zeroes. Also, we have omitted first 28 zero 
columns. From the corresponding six polynomials gi, . . . , gg we obtain the following 
four polynomials 



hi 




91 




92 


h2 

h3 




92 

94 


— w 


93 
95 


hi 




95 




96 



uv 
u 

V 

1 



(10) 



where matrix C(u)) can be represented as 

C{w) 



[4] 


[5] 


[5] 


[6 


[4] 


[5] 


[5] 


[6 


[4] 


[5] 


[5] 


[6 


[4] 


[5] 


[5] 


[6 



(11) 



Remark 2. Since we use only six last rows of matrix B', there is no need to perform 
a "complete" Gauss- Jordan elimination on matrix B'. For the first 24 rows of B' 
only lower triangular entries should be zeroed. 

Denote by W = det C{w). In general, it is a 20th degree polynomial in w. 

Proposition 2. Polynomial W has a special symmetric form: 



10 



,10+fe 



+ {-w) 



lO-fel 



(12) 



where G M. 

Proof. Due to the conditions Xji = yji = 0, we have E33 — 0. As a consequence. 



R23 



t 



13 



vw + u 



uw — V 



Substituting this into the last identity in ([5]), we get w' ~ —w ^. Thus, if Wi is a 
root of W, then so is —w^^. It follows that 



10 



10 



W = Pio - Wi){w + ^) = [w 



10+k 



+ {-w) 



10-fci 



k=0 



Substituting w = w — w ^, we transform W to a 10th degree polynomial 

10 

k=0 



□ 



(13) 
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k 

where pk can be deduced using the formula w'' = J2 {-!)' {'l)w'^'-'' . The result 



reads 

10 



i=0 



■ / i+k 1 \ 

^'==E U y (14) 

i=k '^^ ~ / 

where the primed sum is taken over all i from fc to 10 such that i — k mod 2 = 0. 

10 

2pi. 



2.5. Structure recovery. A complex root of W leads to a complex root of W and 
by (jH) to complex rotation matrix having no geometric interpretation. Hence only 
real roots of W must be treated. 

Real roots of VV can be efficiently found first using Sturm sequences [7] for 
isolating and then Ridders' method [14j for polishing. Then we can recover the 
second camera matrix applying the following algorithm. 

Let wq be a real root of W. First we find the value 



Wo = wo/2 + sign(wo)\/(wo/2)2 + 1, 

which is a root of W subject to \wq\ > 1. After that, we obtain the u- and v- 
components of the solution by applying Gaussian elimination with partial pivoting 
on matrix C(u;o) in (|lip . 

Then we find the entries of R by ([4]) . Given R, the translation vector t can be 
found by performing Gaussian elimination with partial pivoting on matrix S(uo: ^O; ^o) 
in (IH). Here we have also taken into account the normalization constraint ||t|| — 1. 

Let Ht = I - 2tt'^ and R' = -HtR. It is well-known [51 [H] that there are 
four possibilities for the second camera matrix: — [^^ t] , P_b = [R — t] , 
Pc = [R' t] and = [R' — t] . The only of these matrices is correct, all 
others correspond to unfeasible configurations. 

The true second camera matrix P2 can be derived from the so-called cheirality 
constraint saying that all the scene points must be in front of the cameras. In 
particular, this is valid for the first scene point Qi. Denote by 

Cl = -4^ = -4^, C2 = Cii?33 -I- is. (15) 
K23 

Then, 

• if ci > and C2 > 0, then P2 = Pa', 

• else if ci < and C2 < 0, then P2 ~Pb', 

• else if c[ > and c'2 > 0, then P2 = Pc! 

• else P2 = P_D. 

Here the value c'l and C2 are computed in the same manner as ci and C2 in (jl5p 
with R being replaced by R'. 

Finally, the initial second camera matrix is given by 



H12H11 
1 



Pr= (H22H2l)^P2 

where the Householder matrices Hji and Hj2 are defined in Subsection l2.1l 
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-14 -12 -10 -8 -6 -4 

logjjj of numerical error 

(a) Default conditions. The median error is 
1.56 X 10"" for Nister and 2.94 X 10-^° for 
NewSpt 




-14 -12 -10 -S -6 -4 -2 

log, „ of numerical error 



(b) Planar scene and forward motion. The 
median error is 1.52 X 10"'^ for Nister and 
7.17 X 10^3 for New5pt 



Figure 2. Numerical error distribution 



3. EXPERIIVIENTS ON SYNTHETIC DATA 

In this section we compare our algorithm with the original 5-point solver by 
Nister [151 on synthetic data. The C/C++ implementations of both algorithms 
have been written. All computations are performed in double precision. Synthetic 
data setup is the same as in |12) : 



Distance to the scene 


1 


Scene depth 


0.5 


Baseline length 


0.1 


Image dimensions 


352 X 288 


Field of view 


45 degrees 



The numerical error is defined by 

£=||P2 



(16) 



where P2 is the ground truth second camera matrix. 

The numerical error distributions are reported in Figure [21 The total number of 
trials is 10^ in each experiment. We have compared the algorithms first in case of 



default conditions (Figure 2(a) I and second in the most problematic case in sense 
of numerical stability — planar scene and forward motion (Figure |2(b)| . 

In Figure |3] we demonstrate the behaviour of the algorithms under increasing 
image noise. We add the Gaussian noise with a standard deviation varying from 
to 1 pixel in a 352 x 288 image. One sees that in presence of noise the results of 
both algorithms are almost coincident. 



4. Discussion of results 



A new algorithm for the 5-point relative pose problem is presented. A compu- 
tation on synthetic data confirms that it is robust enough. In whole, it is a good 
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Figure 3. Translational (left) and rotational (right) errors rela- 
tive to Gaussian noise. Default conditions. Each point is a median 
of 10^ trials 

alternative to the existing five-point solvers. Its major advantage is that it yields 
a direct structure recovery, i.e. a reconstruction without computing an essential 
matrix. Such approach is more flexible when we are given some additional informa- 
tion on the camera rotations and/or translations. For instance, if the Euler angles 
{Lp,9,ip), representing matrix R, are known to lie in some limits, then so is the 
variable 

w = -2cot(v3-f V)- (17) 

This allows one to discard some roots of the 10th degree polynomial VV at once 
without structure recovery step. 
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